Muscat
Language Model Tokenizers Introduce Unfairness Between Languages
Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different languages can have drastically different tok-enization lengths, with differences up to 15 times in some cases. These disparities persist even for tokenizers that are intentionally trained for multilingual support.
- North America > Haiti (0.14)
- Asia > Philippines > Luzon > Ilocos Region > Province of Pangasinan (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
- (38 more...)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.68)
US and Iran agree to hold nuclear talks in Oman on Friday
The US and Iran have agreed to hold nuclear talks in Oman on Friday, as President Donald Trump issued a blunt warning to Supreme Leader Ayatollah Ali Khamenei. Iranian Foreign Minister Abbas Araghchi said that the meeting would start at 10:00 (06:00 GMT) in Muscat. US officials also confirmed it would happen there. The talks had appeared to be in jeopardy, with the two countries at odds over the location and parameters. Trump has built up US forces in the region and threatened military action if Iran does not agree a deal on its nuclear programme and stop killing protesters.
- Asia > Middle East > Oman > Muscat Governorate > Muscat (0.25)
- North America > Central America (0.15)
- Asia > Middle East > Israel (0.07)
- (17 more...)
- Government > Regional Government > North America Government > United States Government (1.00)
- Government > Regional Government > Asia Government > Middle East Government > Iran Government (1.00)
- Government > Military (1.00)
- Government > Foreign Policy (1.00)
Energy-Efficient Federated Learning via Adaptive Encoder Freezing for MRI-to-CT Conversion: A Green AI-Guided Research
Raggio, Ciro Benito, Migliorelli, Lucia, Skupien, Nils, Zabaleta, Mathias Krohmer, Blanck, Oliver, Cicone, Francesco, Cascini, Giuseppe Lucio, Zaffino, Paolo, Spadea, Maria Francesca
Federated Learning (FL) holds the potential to advance equality in health by enabling diverse institutions to collaboratively train deep learning (DL) models, even with limited data. However, the significant resource requirements of FL often exclude centres with limited computational infrastructure, further widening existing healthcare disparities. To address this issue, we propose a Green AI-oriented adaptive layer-freezing strategy designed to reduce energy consumption and computational load while maintaining model performance. We tested our approach using different federated architectures for Magnetic Resonance Imaging (MRI)-to-Computed Tomography (CT) conversion. The proposed adaptive strategy optimises the federated training by selectively freezing the encoder weights based on the monitored relative difference of the encoder weights from round to round. A patience-based mechanism ensures that freezing only occurs when updates remain consistently minimal. The energy consumption and CO2eq emissions of the federation were tracked using the CodeCarbon library. Compared to equivalent non-frozen counterparts, our approach reduced training time, total energy consumption and CO2eq emissions by up to 23%. At the same time, the MRI-to-CT conversion performance was maintained, with only small variations in the Mean Absolute Error (MAE). Notably, for three out of the five evaluated architectures, no statistically significant differences were observed, while two architectures exhibited statistically significant improvements. Our work aligns with a research paradigm that promotes DL-based frameworks meeting clinical requirements while ensuring climatic, social, and economic sustainability. It lays the groundwork for novel FL evaluation frameworks, advancing privacy, equity and, more broadly, justice in AI-driven healthcare.
- Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.05)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- Europe > Germany > Schleswig-Holstein (0.04)
- (4 more...)
- Health & Medicine > Nuclear Medicine (0.94)
- Health & Medicine > Health Care Providers & Services (0.67)
- Health & Medicine > Diagnostic Medicine > Imaging (0.66)
- Health & Medicine > Therapeutic Area > Oncology (0.47)
FDRMFL:Multi-modal Federated Feature Extraction Model Based on Information Maximization and Contrastive Learning
This study focuses on the feature extraction problem in multi-modal data regression. To address three core challenges in real-world scenarios: limited and non-IID data, effective extraction and fusion of multi-modal information, and susceptibility to catastrophic forgetting in model learning, a task-driven supervised multi-modal federated feature extraction method is proposed. The method integrates multi-modal information extraction and contrastive learning mechanisms, and can adapt to different neural network structures as the latent mapping functions for data of each modality. It supports each client to independently learn low-dimensional representations of multi-modal data, and can flexibly control the degree of retention of effective information about the response variable in the predictive variables within the low-dimensional features through parameter tuning. The multi-constraint learning framework constructed by the method guarantees regression accuracy using Mean Squared Error loss. Through the synergistic effect of mutual information preservation constraint, symmetric Kullback-Leibler divergence constraint, and inter-model contrastive constraint, it achieves the retention of task-related information, the extraction, fusion, and alignment of multi-modal features, and the mitigation of representation drift and catastrophic forgetting in non-IID scenarios, respectively. This ensures that the feature extraction process always centers on improving the performance of downstream regression tasks. Experimental results from simulations and real-world data analysis demonstrate that the proposed method achieves more significant performance improvement on downstream regression tasks compared with classical feature extraction techniques.
- Asia > China > Hong Kong (0.04)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East > Oman > Muscat Governorate > Muscat (0.04)
A long lost silver dollar may be worth 5 million
The'King of American Coins' remained hidden in a late collector's archive for decades. Breakthroughs, discoveries, and DIY tips sent every weekday. One of the country's rarest coins is rarer than even expert coin collectors believed. After the surprise discovery of a long-lost 1804 dollar (aka the " King of American Coins "), the rarity's total known count now stands at 16. Regardless of its ranking, the silver coin is expected to fetch significantly more than its original worth when it hits the auction block on December 9. According to auctioneers at Stack's Bowers Galleries, the story begins with former President Andrew Jackson.
- Asia > Middle East > Oman > Muscat Governorate > Muscat (0.06)
- North America > United States > New York (0.05)
- North America > United States > California (0.05)
- (2 more...)
From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation
Zhang, Zhihao, Zhang, Yiran, Zhou, Xiyue, Huang, Liting, Razzak, Imran, Nakov, Preslav, Naseem, Usman
Infodemics and health misinformation have significant negative impact on individuals and society, exacerbating confusion and increasing hesitancy in adopting recommended health measures. Recent advancements in generative AI, capable of producing realistic, human like text and images, have significantly accelerated the spread and expanded the reach of health misinformation, resulting in an alarming surge in its dissemination. To combat the infodemics, most existing work has focused on developing misinformation datasets from social media and fact checking platforms, but has faced limitations in topical coverage, inclusion of AI generation, and accessibility of raw content. To address these issues, we present MM Health, a large scale multimodal misinformation dataset in the health domain consisting of 34,746 news article encompassing both textual and visual information. MM Health includes human-generated multimodal information (5,776 articles) and AI generated multimodal information (28,880 articles) from various SOTA generative AI models. Additionally, We benchmarked our dataset against three tasks (reliability checks, originality checks, and fine-grained AI detection) demonstrating that existing SOTA models struggle to accurately distinguish the reliability and origin of information. Our dataset aims to support the development of misinformation detection across various health scenarios, facilitating the detection of human and machine generated content at multimodal levels.
- North America > United States > Washington > King County > Seattle (0.04)
- North America > United States > Utah > Summit County > Park City (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- (8 more...)
- Media > News (1.00)
- Health & Medicine > Therapeutic Area > Immunology (0.95)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.95)
- Government > Regional Government > North America Government > United States Government (0.68)
- Oceania > Australia > Victoria > Melbourne (0.04)
- North America > United States > South Carolina (0.04)
- Asia > Middle East > Oman > Muscat Governorate > Muscat (0.04)
- (2 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology (1.00)
- Government > Voting & Elections (0.67)
- Media > News (0.53)
- Government > Regional Government > North America Government > United States Government (0.45)
- North America > United States > Michigan (0.04)
- Europe > Sweden > Uppsala County > Uppsala (0.04)
- Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
- (4 more...)
Speaking at the Right Level: Literacy-Controlled Counterspeech Generation with RAG-RL
Song, Xiaoying, Anik, Anirban Saha, Barua, Dibakar, Luo, Pengcheng, Ding, Junhua, Hong, Lingzi
Health misinformation spreading online poses a significant threat to public health. Researchers have explored methods for automatically generating counterspeech to health misinformation as a mitigation strategy. Existing approaches often produce uniform responses, ignoring that the health literacy level of the audience could affect the accessibility and effectiveness of counterspeech. We propose a Controlled-Literacy framework using retrieval-augmented generation (RAG) with reinforcement learning (RL) to generate tailored counterspeech adapted to different health literacy levels. In particular, we retrieve knowledge aligned with specific health literacy levels, enabling accessible and factual information to support generation. We design a reward function incorporating subjective user preferences and objective readability-based rewards to optimize counterspeech to the target health literacy level. Experiment results show that Controlled-Literacy outperforms baselines by generating more accessible and user-preferred counterspeech. This research contributes to more equitable and impactful public health communication by improving the accessibility and comprehension of counterspeech to health misinformation
- North America > United States > Texas (0.14)
- North America > United States > Maryland (0.04)
- Asia > Middle East > Israel (0.04)
- (3 more...)
- Media > News (1.00)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- (3 more...)
- Asia > Middle East > Jordan (0.04)
- North America > Canada (0.04)
- Europe > Germany > Berlin (0.04)
- Asia > Middle East > Oman > Muscat Governorate > Muscat (0.04)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Health & Medicine > Health Care Technology (1.00)